Unsupervised Accent Modeling for Language Identification
نویسندگان
چکیده
In this paper we propose to cluster iVectors to model different accents within a language. The motivation is that not all the speakers of the same language have the same pronunciation style. This source of variability is not usually considered in state-of-the-art language identification systems, and we show that taking it into account helps. For each language, the iVector space is partitioned according to the similarity of the iVectors, and each cluster is considered a different accent. Then, a simplified probabilistic linear discriminant analysis model is trained with all the accents, and during the test, each utterance is evaluated against all of them. The highest score of each language is selected to make decisions. The experiment was carried out on 6 languages of the 2011 NIST LRE dataset. For the 30 s condition, the relative improvement over the baseline was of 11%.
منابع مشابه
Use of Trajectory M Automatic Accent Cl
This paper describes a proposed automatic language accent identification system based on phoneme class trajectory models. Our focus is to preserve discriminant information of the spectral evolution that belong to each accent. Here, we describe two classification schemes based on stochastic trajectory models; supervised and unsupervised classification. For supervised classification, we assume te...
متن کاملImprovements in Non-Verbal Cue Identification Using Multilingual Phone Strings
Today’s state-of-the-art front-ends for multilingual speechto-speech translation systems apply monolingual speech recognizers trained for a single language and/or accent. The monolingual speech engine is usually adaptable to an unknown speaker over time using unsupervised training methods; however, if the speaker was seen during training, their specialized acoustic model will be applied, since ...
متن کاملUnsupervised Learning of Tone and Pitch Accent
Recognition of tone and intonation is essential for speech recognition and language understanding. However, most approaches to this recognition task have relied upon extensive collections of manually tagged data obtained at substantial time and financial cost. In this paper, we explore unsupervised clustering approaches to recognize pitch accent in English and tones in Mandarin Chinese. In unsu...
متن کاملA comparison of two unsupervised approaches to accent identification
The ability to automatically identify a speaker’s accent would be very useful for a speech recognition system as it would enable the system to use both a pronunciation dictionary and speech models specific to the accent, techniques which have been shown to improve accuracy. Here, we describe some experiments in unsupervised accent classification. Two techniques have been investigated to classif...
متن کاملImplications of Prosody Modeling for Prosody Recognition
This paper introduces Stem-ML, which is a model of the prosody generation process with an associated description language, and suggests how it may help prosody recognition. We applied Stem-ML modeling to three topics: the modeling of prosodic strengths, intonation types, and noun phrase patterns. Stem-ML parameters derived from )&* contours may have a more consistent relationship with prosodic ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014